AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Complex scene understanding

# Complex scene understanding

Pixtral 12b
Apache-2.0
Pixtral is a multimodal model based on the Mistral architecture, capable of processing both image and text inputs to generate detailed textual descriptions.
Image-to-Text Transformers
P
mistral-community
31.93k
90
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase